WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 
G01N 33/48 



A2 



(11) International Publication Number: WO 00/54046 

(43) International Publication Date: 1 4 September 2000 ( 1 4.09.00) 



(21) International Application Number: PCT/USOO/06244 

(22) International Filing Date: 10 March 2000 (10.03.00) 



(30) Priority Data: 
60/123,586 



10 March 1999 (10.03.99) 



US 



(71) Applicant (for alt designated States except US): THE GOV- 
ERNMENT OF THE UNITED STATES OF AMERICA, 
as represented by THE SECRETARY, DEPARTMENT OF 
HEALTH AND HUMAN SERVICES, THE NATIONAL 
INSTITUTES OF HEALTH [US/US]; Office of Technology 
Transfer, Suite #325, 6011 Executive Boulevard, Rockvitle, 
MD 20852 (US). 



(72) Inventor; and 

(75) Inventor/Applicant (for US only): GE, Hui [CN/US]; 
Upshire Circle, Gaithersburg, MD 20878 (US). 



437 



(74) Agent: NOONAN, William, D,; Klarquist, Sparleman, Camp- 
bell, Leigh & Whinston, LLP, Suite 1600, One World Trade 
Center, 121 SW Salmon Street, Portland, OR 97204 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR. BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM. DZ, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ, TM, TR. TT, TZ, UA, UG, 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, 
LS, MW, SD, SL, SZ. TZ. UG, ZW), Eurasian patent (AM, 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: UPA, A UNIVERSAL PROTEIN ARRAY SYSTEM 



(57) Abstract 

This invention relates to ordered arrays of molecules, for instance polypeptides or proteins. Such arrays, referred to as universal 
protein arrays, are described in both macro- and microarray formats. Methods of production and use of such arrays are also described. 
Also disclosed are kits for the use of such arrays and the preparation of probes for use with them. 
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UPA, A UNIVERSAL PROTEIN ARRAY SYSTEM 

FIELD 

BACKGROUND 

- P , ay . Pivo ,„ role j al j ::: P ° f — - — - 

polymerase II. However an addiii , ■ " ' TF " H 3nd ™ A 

v^ijjudniaese/o/., Genes Dev., 10 2657-?fi)n ioo< n ■ 
We, 386, 569-577 1997^ Rmhk , , ' " 6; Ptashne and Gann, 

— : e ri i::;r v r ~ s - ~ ^ *-* 

identifying anticancer drugs. H^no^enesis and of 

»»dpro lc i„.RNAin,=rac,i„,(Fi,l dsa „ ds „„„, T , en oto »"0 "> prote.n-DNA 

(SAGE) (V lcules ; u 2 s seve "' ■"*•*• includins -** «- 
«■«.«. i»« tave J; I " °" 8on "' :tai ' te - te « i »a d*. «*. «, „,„ &ie „ ce , 274i 
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acmig ractors are invo ved and hnw tu~* u 



SUMMARY 

» — up* - a,:„ bc : c:r: e ' rep,ica,ion ' , ™ si *°- *- — . - 

polypeptide samples, W hich can be arr , • yS WlU indude at le ™ 1 0 

p«* — or , te Prewio „ x - " te ' 9o% ' - - '» — «. 
».<*„„,, SPICinc ram r ■ u ™ s " ,p,ion *' """" ,r, " scrip,i< """ 
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from the following group 0 f polypeptides: TFHA, TFJIB TBP fTFIID trip tc 

"■ «. <*„, Sp,. C<. H C,U,, G4-AH, 04-V P ,« 0 4 ^ ^ " « 

Tat, PC4-P, PC4-N PC4-C PC4 aq ^ Pl * G4 " E1A ' G4 - ,E . G4 " 

' <».)•. Top. , ( „ ati) , ASF, SR. OST.K and CST^K Th ' ^ ^ ' ^ ' 

stranded nucleic acids fDNA nr Rw^ • NA) ' double - 

contacted to an arrav of tarppt ™i S } ' he probe IS 

prior tons lhe bMhg ' ' ' ^ °' 0lhera ' S ' ™">» d fr » '»= «•>. fo, « 

P "■«ir(,.s,,mmuno se „,c,ry, i„ te „„, fluorescence, etc.). 
J his invention also provides kit*; for i a k«r 
protein intern as8ays ' J^" "* kb,! "" S —* ■ »• — « 
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be included in the kit. ° r un,abe '«l) may also 

^ Certain P robe '^''ng k it swi „a, S oi„cludeoneor m orea rr a y s forinstance 

substantially pure polypeptide molecules 30 ^ ° f 

labeling process or for. „p in « r • ouuers Ior use in the probe 

kS '° ™»» » *«« methods can be „.. otot , op „ , 

0.ber methods provided include meftods of analyzi,,,, , 0 | mli , v n , h . „. „ 
of an array of polypeptide samo.,, , t . * "''"' g ch «™™™Ms 

Arrays used in these method k CharaCtens,,cs of < he array of polypeptide samples. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1: Protein-Protein Interactions 

The universal protein array (UPA) provides quantitative detection of 
interactions at different salt wash stringencies Fin , a * u ' P'otem-protein 

5 from the herein-described UPA that k Radiographic detected 

in aescr.bed UPA that was incubated with "P-labeled GST-K ,u 

A 1000 buffer (100 mM KCI). hlgh saIt 

Zcl^T^™ 1 ™^^"^ - IB. 

2: P ro « ta . DNAi md Pr0 , ein . L . g|nd lnier>ci . onj 

probed w,th P-labeled nucleic acids (Examples 4 and 5) or with hh P l.H 
S tnp P ed a „de q ui,ib ra tedi nb ufferA100 !a sdescn b edin£xa mp ,e2 

Figure 3: Detection of ASF/SF2-Inte racting Proteins Using a upA 
6H(K)AsZf2 Fi^A ^'"f 7^ fraCti ° nS ^ imeraCtion wi,h 32P-labeled 

ie- and Fig. 3C are autoradiographs of the binding patterns on the UPA aft 
w.th "OOmMKClorSOOmlKCUespectively. ms on the UPA after washing 

Key to the abbreviations in Fig. 3A: CTD, the C-terminal domain of RNA polymerase II 
^^^^ 

altematwe sphcing factor; GST-Nu, GST-nuc.eo.in fusion- GST K GST f „ V 

muscle kinase site. * ° ST f " Sed Wth a s y n,hetic heart 
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DETAILED DESCRIPTION 
I- Abbreviations and Definitions 

A* A bbreviations 
5 ASF: alternative splicing factor 
Co-His: co-histones 

CTD: the C-terminal domain of RNA polymerase II fused «o GST 
GST: Glutathione S-transferase 

GST-K: GST fused with a synthetic heart muscle kinase site 
GST-Nu: GST-nucleolin fusion 
His-Hl: histone HI 

' 5 HMG 1 hi S h mobility group protein I 
Oct l:B-cel] specific activator 
p52: novel transcription factor p52 
p75: novel transcription factor p75 
P75-C: C-terminal region of novel transcription factor P 75 
-0 P 300-C: transcriptional activator 
PC4: positive cofactor 4 

PCAF: a P 300/CBP-associated factor that functions as a h.stone 
PCAF-C: Cterminai region of PCAF 
Pol H: polymerase II 
Spl: class II gene activator 

■»* ••<*. prepared from HeLa « „ uclear extracIS 
RPB5, RPB6, RPB8, RPBlOct and RPRirm- 
30 II fused to GST ^ » ^ ^ Polymerase 

RXR: retinoid-X receptor 
TAF2S0: transcriptional coactivator 
TBP: TATA-binding protein 

TFIIA, TFIIB, TBP, f:TFIID, TFIIE, TF1IF fTFIIH r. n 

35 HE, HF, and IIH " " ' ranSCri P tion ^tors IIA, I,B, I,D, 

Topo I (wt), Topo I (mt), Topo 1 ( wt )* Todo I 

Table 2) P ( * Van ° US to P ois ^erase I polypeptides (see 



WO 00/54046 

PCT/US00/06244 



7- 



TR; thyroid hormone receptor 
UPA: universal protein array 

P- Definitinnc 



5 ^—jz~z:z"z::-~->'~ 

Pushed by Cford University Prcss , 2M0 OSBK 0-, « ' *" 

9); and Robert A. Meyers (ed ) M,,„„, „. , L ' d " ' W4 <ISBN <"32.02.82. 

^—.published yVCHp^ 7 '°^^ S ~^»^W,V c0 „ i 
' CH """"""m. Inc. 1995 (ISBN 1.56081.569.8) 

is miniaturized so as to reauire mipmw • A mieroarray" „ an a h 

» require m.croseopie examination for evaluation 
Within an array, each arrayed molecule is addressable in •, , ■ 
»d consistently determined within , he at , M! , MO „. ' '" "* ^ <» " ">**> 

»ays tbe location of each molecule 11 ' """ "** ^ » 

20 onto the array surface and IT. k " " " " » is 

~ ,a, 8 „. 0 «I ^ a™ " ' " " ~ - - 

»«.d be arra„ S ed in " 7" " S " P ""™' b "' -*« 

. .the, patterns (,.*. ,„ radially distributed lines or ordered clusters) 

<"*-. «, shape ofl™ " » . 

A key to one example array is shown in Tahle i r . • 
Example 1. This array has 48 aM Con ^« IO n of this array is described in 

Each address position can be referred to b a ""« l " ™« ^eled -,- through "ll- 

the array, for mstance, transcription factor HA (TFIIA) 
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*r»»d.,b„„, address l aa „d.dd ress .b.coltaiv^addr^s mspairo[M , „. „ 
f ef^,„ by , henun ,„, l(1)(roundcentcredabove - - .»«-.■. 

.f »scH P ,i„, factor „ B an,B,. H„,«„ nlally , ' ta ° *" a 

«■* ( .) ,™ eh w . used , , he te column of Tabie c ;; , e ^ ™ 

— ^d,„ 6da , a 



B '"' li "S°""«-'«o".-A„,, S oe ia ,i„„ bcweei ,„ vosubs , anceso 
y. a probe binds to a polypeptide of an array of this invention if - u 

«ore), , deteaable moum of , be probi assoc . ies ^ 0 „,,„„,« „ 

;^-**-i^.»^d^,^ B ^^^ 

P«rs,s, , to the , my , s washed ,„ a „ isber ^ buffer 5M ^ ^ ^ 

The «™ - bindi „ s charlcleristics of a „ amy ^ a ^^^^ ^ 

p-- - „•„ , My have iis „ als of difeing imensiw ; ,^ ™ 

*e key of ,he ,„a y (wh „ b , iso , ht ^ rf , he ™- « 

b-ta, - « ,« fad aI e „ b address „ , he 

- «, h carried out ^ any exis ^' - p - * 

DMA (d,„ yribonuclllc acU): DNA , „ |ong cha . n 

„„,„ ,„ DNA po, ym « are four difrErE „, „ ;ach » 

- bas es (ato ,, e , guaniTO , cy , osi „ e „ d %mim) ^ ro ^ * 
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in a polypeptide, or for a stop signal. The term "codon" is also used for rh. 

complementary) sequences of thrP , •„ corresponding (and 

transcribed. sec ) uence s of three nucleotides in the mRNA into which the DNA sequence is 

High throughput genomics: Application of genomic or genetic data or an , • „ 
5 that use microarrays or other penomi,-, u , • senet,c data ^ analysis techniques 

ays or other genomic technologies to rapidly identify laree nnmt, r 

'"**"'" A " " iS ° lMd " ^tfal ccpo™, ( Such as , acid molecofc „„, 

0„ E „„a„, 0 „ Je: ito^ ^ 
b*. * a polynueleo , ide (sucb _ MA or ^ ^ . 

0 ™P'""«.l5,50,I00oreve„200 n „cleo,ide s lo„ s . 

A„ „I ig o™c, e „, ide anaiog „ fcrs ,„ mo , ate thK fnnai<>n o|igonuc|eot . des 

^---:rcI^l-r.■ ,,,, * , -■*■ 

monomers coupled by amide CoemidP^ hnnw. u • ^mpnsea of 

bo»d s . ™*CP<P"*=>bond s ,s„ch asm™, acid „,„„„„«„ joined by peplide 

P™ta A m ol«„le ,h„ may bi „ d ,„ or inttnM wi ,„ 0 „ e „ , 

- « i. - „ ta ,„ y molecok ta „ „ sed B p .. y " y ' 'I 

nw*. * , h „ Probe motale . ,„ sp „ ific embofa » • « 

P*. be from fc a„d wrie d ^ ^ for 
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pro.e,„ fragments including domains or sub-domains, and mutants or variants of natu „ 
P-ins), or various tvpesorotHerpotentia.po.ypeptide^ 

are referred to herein general* as Hgands (such as drugs toxins vent h $ 

5 minerals, and so forth). ' * anSlt, ° n State anaJ °gs> 

Usually, a probe molecule is detectable for use in probing an array of the ■ 
can be detectable based on their inherent characteristics (e I ■ ^ 
detectable by being labeled with ■ „ nS,,CS (t *' ,mm -ogenicity) or can be rendered 

<y uereciawe. Specific examp es of ta«s are flunr«rA«* . - 

b= used, fo, ,„,„„« mix , u „, 1 2 ' ' '" ™ m »f Pr«b« s „i„ 

w.tn different tags, such that they can be simultaneously detected « A-tr 
<**. nvo fluorophors that emit at different wavelengths) ^ S,S " a,S 

-.:=:r;r;-r.~rr,::r" 

w -** — * - «. « Probe ta „ ni bE as ;:: s ^ ■ m ^ 

instance, if the array will be used to • u • 06 usea "> assay the array. For 

= ==r=r— s; 

«- « m b<, d , m e„ B , „ beled ptobe stan<farts will be provMed h ihe kta ««■**■. 

Probing: A s ust d here i„, ,„= „™ >obing .. „„„ „ 

T ion> in "* r ,o *- - ~ - ~7 

interact with molecules immobilized on the arrav • , , 

on me array. Synonyms include "interrogatine " "chaH^i,,., » 
screening" and "assayin°" an amv Th.., • cnallenging,' 
yn a array. Thus, a universal protein array of the invention is said to be 
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"probed" or "assayed" or "challenged" when i, is incubated with a probe mo .e , , 
Polypeptide, nucleic acid molecu.e, or ligand). ^ (SUCh " 3 

Protein/Polypeptide: A biological molecule expressed by a gene or other „• 

d P° n Jon of the individua polypeptides wirhin th. 
in so,,,™ lha , ,„ appli[d „ J, ^ ^ " ° f b '"<""S ,. „, „„ h 

» -se, „ e assoc zz ::::: ,n som! dr ™ k mv * » - ■ 

poo,.™; ^r^^-*""---*-.-*-* 

- ,» S e„:„ ,„ <bU " ni " d -*«> ■"*«"«- 
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Purified: The term purified does not require absolute purity rather i, • ■ „ 
relative term n,„, f», , P y> ather " " ls ""ended as a 

tern,. Thus, for exa mp ,e ( a purified protein preparation is one in which the specified ■ 

10 „, „• •" m0reof,he "" al P'<«'»»"««.<»f.h« P rep i ,„,io, 

^ .rsc^e. ™, , nincial combination can be : ;:, ; 

more commonly bv the arrifi/Mni • . tnemicai synthesis or, 

engineering , e cM,„I man,p, " a '""' »f Elated segments of nucleie adds, e.g.. by ge„e,i c 

(NH 4 ; 2 SOj and 1 M urea), for instance at room temperature for about «n 
^nutes. Usually, the stripped arra y wi„ be equilibrated ,„ . ^ J^^T 
mcubation with another probe molecule Pn ° r '° 

. he are j;;rr a r^ x~r;r ,rr s i:: r::;;^r of 

polypept.de of the UPA can be visuali 2e d by detecting ,h, i,k , „ u 

uy uerecting the labeled probe molecule. 
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In variations of the UPA technology, the detectable probe is a specific m • , 
S .n g ,-or doub ,^ 

■nteracon with the polypeptides of the UPA. 

Unless otherwise defined, all technical and scientific terms hqpH k • t 
--™,y Un de^^ 

case of conflict the Dresenr^ir • described below. I n 

. present spec.ficat.on, including definitions, will control In ,aa v , 

Universal Protein Arrays 

tor analysis of protein-molecule interactions. Although dot blor ,„ i • . , 

provides a simple svstem f« r ^ * • . denaturing conditions, t 

molecules. 1108 nat ' Ve imeraCt, ° nS ^ «- P-be 

Known techniques fall far short of the UPA invention disclosed herein ,„ the ff 
western blot analysis, protein fractions were usuallv analyzed by SDS PAoT , ^ 
25 to a membrane followed hvH ' SDS-PAGE and electrotransferred 

mbrane, followed by denaturation and renaturation before probin* with a radiohh , h 
probe. On average only 1-100/ nfth - . , " rad| olabeIed protein 

B e, only i 10 /o of the activity (without considerino the loss nfnm,,.' a ■ 

with each other. * ' " ° 6 eaS " y quan,ifled and 

compared 

» ere .„ used only fw protein . prolc . n . mBnct . oii: ^ 
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footprints and cross-.i„ k i ng assays were used for proteinuric acid (DNA or RNA) i , ■ 

Given that the major part of the human genome sequence has been identified that ,„ ■ 
genome sequence is expected to be completed by the year ^003 «*,.• ^ 
689 \9m and ,h« Dymeyear 2003 ( c o»'ns e/ 0 /.,^/ e/Icei282 6g2 . 

282, 682-689 I99« w :n „ • , pnmucts (Collins e/ a/., 

V ' IV98) vvl11 P rov 'de a rich source of proteins for UPA m „H;,, a , 
— Tile UP A syslem not only provjdts „ , traa ^ '° 

*»».. wi* ,„ me „ dous po , enIia| for diwase „ . s . »"« » »» 

rnpjes g.ven are meant in no way to limit the invention. 
Choice of polvpppti^ t^ rrrn 



0 



™ mpfc Mra ,„ ,„ seo of lnlera are weli inown and ^ 

—.areola, tob „e. O, to , arg „ swi , lbEiden , ificdfr0mjoun , alirti e -122 

» » ,sc„ „ for inslance t0 Msis , da , a ^ sonjng ^ ^ - 

Any po„p ep , te e a » serve „ , argeIS for „ S5 ' 

re SP ,ra,ory pa.hw.y), proleins ^ „ bf ^ e <"° <* 

f.r.*.- f y,, 8l h^„ e , lononnd , idnalEST . |inkedgcntsandiheproieinsihey ^ t 
"X o, s „,„ pi „ g of polypeplides „„ be = 

By »ay of example, „, re appro>lmal!l ,„„ „ 
- *e hm S e„ 0 ™, eve. ^ ^ f„ by eact h „ mm gc „ ^ ^ 
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«- d« vta!c , taleria , ^ ^ } rcseirch °* 

10 Escherichia coli t etc.), and so forth. ' 
As demonstrated below (Examnle TV t ipa ; «• • 

UNA or RM-b,„d,„g dom.ms of a F o tt i„. ,„ ee „ ain upAs 

»d po lypep „ de dllc , ions of , a ^„ g |e „ glh „ a ser|es of ^ ^ - Of 

on,™ ragme01s of one pMd „ „ . f , mi|y of dose(y re|M ™ 

«. „ imPM , ,„ known „ suspecied 2,: » 

urge, p,„,e m „„ d te probe or p , obes ^ , o ^ ^ ^ »•» 

» Applies of ,„e mimal prolein array Kch „ 0 , ogy are |to . ^ m 

faelors , ^ fo „ owin8 ^ ^ 

~ w i;h „ sly , is of sueh ^ upA ma , ysis cou , d j|so ^ ^ 

p.yp, P „d= b,„ dmg c of „„„ iPle PTOin profiles expressed dari various d 

c .„, tions , . mll as in noma| pr anima , « 

> f'«""i'fr«™t,™ s geni c ,„im,lsorcul t u redee ll s . »=P™fita 
Po, ypt p, ide mys accor<li „ g „ 4 . . nvcmion ^ j]so ^ ^ to 

on genos « ^ discovered f , om , f(lr cxample , , igh . lh(00ghp „, ge *» 

sequencing, DNA microarrays or SAG P^rioi a i • ^ 

c . „ m y». or SAGE (Serial Analysis of Gene Expression) (Velculescu e/ fl/ 

SC T' :4M - 4a7 ' P °' Wide "~ — * - «* —ion a , so b Zt' 

Thii ^ hgy ca „ „„ ta « „ f o„„ w p rograsion of polypcplide cbm ;; ; 

*. « — d if fe re n, c.„«, or „ di _ 0lh „ , hM canc „ po ; ^ » 

.n,s,nv e n,,on„ ayb en s e d ,o ide „, i f y . ndana|y2eptognoslicniarkersorma 
therapy outcome for various diseases or a h„ nm ,,i ^ ■ 

,h. • f u conditions, such as cancers. Arrays compiled from 

the proteins of hundreds of cancers derived from „„• , -u, 

VCd from P at,en,s w '* known disease outcomes permit 
binding or association assays to be performed on ,h.„ , 

penormed on those arrays, to determine important prognostic 
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2r2ZT" s predictins therapy outcome ' which are associated with po,ypep,ide 

Po.ypep.ide aiT a y s according to this invention may also be used to help assess the abi.ity of 
cena.n dn.gs or po.entia. drugs to interact with target po.ypeptides, or the abi.ity of such .olecuies 
to block the interaction of other probes with arrayed polypeptides. 

The UP AS of this invemion can be ^ to . ^ , gate recep(or ^ ^ 

of known and suspected receptor mo,cu,e, Examples of receptors that can be investigated for 
probe-specific binding by arrays according to this invention include but are not limited to 
-.organism receptors (for instance, those found in fungi, protozoa, and bacteria, especially 
actenal strains that are resistant to antibiotics); hormone receptors (including those involved ,n 
■abetes, growth regulation, vasoregulation, and so forth); and opiate receptors (involved in 
b.olo gl cal responses, for instance to addictive drugs). 

Also envisioned are arrays that are custom produced for the researcher, with an arrayed 
collecuon of peptides tailored to a specific research project, research system etc 

Not -n any way intending to be .imited to the list below, the fol.owing is a lis, of the types of 
coHecon, of polypeptides that can be arrayed on a UPA according to this invention- al, or 
substantial al, the proteins encoded for by the genome of an organism; al, or substantia,* al, the 
prote.ns encoded for by a chromosome of an organism; proteins expressed in a cell durin* a 
pamcular growth phase or environmental condition; proteins expressed in a cel. under a particular 
abnormal state (such as cancer, disease, or infection); proteins expressed in cells at various times 
dunng the progression of a disease or condition (e g ., during progression of a tumor, or development 
of a chrome disease such as Alheizmers); proteins expressed in a particu.ar ce„ type; proteins from a 
pamcular protein family ( , g ., DNA polymerases, cel. surface proteins, transmembrane proteins or 
ragments [such as so,ub Ie fragments] thereof, oncogene proteins, tumor suppressor proteins, and so 
form); protems that show science homology to each other; proteins that share secondarv structural 
charactenstics; proteins that associate to form multimeric complexes (,,., the subunits of a ribosome 
or a membrane ATPase); vira, epitopes; domains of proteins; proteins from different species- and 
collecnons of fragments of any of these protein collections. 

Product ion of substantially pure tar^t polypeptides 

Polypeptides for use as targets on the subject arrays can be produced by any technique that 
yelds nat,ve protein. These techniques in general include expression from engineered DNA 
constructs, extraction from native samples (, g „ clinical samples), or de novo synthesis of 
oligopeptide or polypeptide fragments. 

Expression of the target polypeptides can be carried ou, using wel, known techniques For 
■nstance, partial or fu.,-.en g th cDNA sequences, which encode the protein of interest as a target on 
the UPA may be ligated into bacteria, expression vectors. Methods for expressing ,arge amounts of 
prote.n from a Coned gene introduced into Escherichia coli (£. co / 0 may be uti!i2ed for the 
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» „ d pu , ito „ on of |mac[ „„ ive , argo praeios Meiho<is 

S»=h f „s,o„ p,oK,„ s ™ y oe made ,„ ||rge raounis md ^ V°*. ,oso). 

; r pro : :~ ; m ~ >— - * -X 

described by Sambrook et al. (In Molecular Clonic a , a «"8«es are 

York 1 989} V,,, Ma "^ CD. 1 7, CSHL, New 

York, 1989). Vector systems suitable for the expression of te2fusion genes jncIlldeth ' . 

ofvectors(RutherandMuller-Hill. EMBOJ.*m\ .983) oEX> ^ , 

vuoQ ,no^ . iy»jJ,pEXl-j (Stanley and Luzio EMBO I 

3.M29, >^)a n dpMR,00(Gray. a ,,^,^ / .^ 5c/ . ^ J*""- 

. ivbix pKKI77-3 (Amann and Brosius, Gene 40.183 1985) and D FT - ,• , 
Moffatt. y M,/. fl/ 0 /. |89:1 13, 1986). (Studiarand 

Choice of array format and simrn.n. 

M „, ^ ,„ delec , „ bound probe 0 „ , he upA , hough quiniifa[ion p™. 

2;; - - — « * - ~«* . ^ Mowcknoiogy c. mpMits , ; :r, he 

~JT FAST 7 s " dei sys,tm <s — — • ■>-* — >. ~* d 

■ p«tth ofpolymer on (he surface „f . gtes s „ de Ipora " s 
.» g «h™,. a ,„ g e,„„ a,. m . y should be dlscr „ 5i .„ t|]ai 

— — sis „„ s „r« 1S h 00rins , a , gtls , ei , h „ by ^ naw - 
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5 •™»S' m <»'»"*="..c-.a m y is s»chtai„divid„a l spo ts c,nl«di s ,i„.u isl , rf , t 

=r ,™„.,,«,,..,«,„_,~t == 

■5 TI »"'»™l,„„„ 1 s„„, n « css „ y , othei . 1 , vem . onitlioiish 

In n,ic™„, y UPAs, , con™ fMllre h lhc sn , a „ si2e 

»n,™ o,„ W0 „ individual arm w ff ^ spot _ ^ » ,. 

40,000 SDOt , Using m ,„ ofabric „ ion ^ b 

•mr of, „,,„<,„ differem „ rgtt sit& T „ e power of mkromav _ foimii P 

Th« „oun, of polyp^fc ,„ gH samp|= |Ja , . io ^ adte 

., , heamy fonM , osei For ins[ancs n 

~ . - ta maCTOamy , By „ ay of ex j mdi 

» ^ on a mi „„„ ay wilI osually be kx ±m , pmo| ejch * 

.8 pn.01, a to , „ pmol , aboul „ „„,„, ^ „ , pmo| ^ QJ pmo| _ ^ ~ • — 

in acton. ,h= s„rfac, a™ of ^ appllcation ^ ^ 

(having a smaller surface area). P P 
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35 



The , a r a „ po, yP ep, We itself(eg., the „„ e , h „ „. „ 

- *- -*L „ «. ma „„.,, » „ ^ f „ „„ of „ di ^ ski] , , n ^ m k — 
taermme of range of <WB « molecule am „ mB , ha , p , odoce ilMrptMWe ^ 

A„o,her „,„ ,„ oesclbe a „ , my b ils density . , ht „ umb£r of ^ ^ _ 
felled S u,f«e are, For „,.cr. a rr ay s of the current ^ ^ y .„ 

=™,en „„, larg e, per !qil „ ed decimet „ (w ^ adtes 

- .*«.> to a o» ut » ,„ 8 e, s ^ ^ „„„ mel „ (50 , argeu wJthin a ; cm 

r.g,.« or , h . *«» Por rn.ero^, , my denslIy wi „ usua| , y be onc j 
«».».«, or .ore. for i„ SIa „ce abo« 50, .„„„, , 00, ab „ u , 2 „„, ,c„ 300, .he,, 400. a bo«, 500 

*« ,o»o, a .. ut ls „o, abou , 2 , 500 , , b0 „, 5W abou , |0 000> abou _ abou _ ■ 

15 more targets per squared centimeter. 

5: Application of targets to arrays 

Targets on the array may be made of oligopeptides, polypeptides, proteins, or fragments of 
these molecule, OHgopeptides, containing between about . and about 50 linked amino a'ds can be 
yn nes,ze readily by chemica, methods. P h ot oI ithographic techniques allow the synthesis o 
•0 hun reds of thousands of different types of oligopeptides to be separated into individua, spots on a 
s.ng, ch.p, ,n a process referred to as * silu synthesis, as has been done with o.igonudeotide arrays 
Longer polypeptides or proteins, on the other hand, contain up to several thousand amino ' 
acid revues, and are no, as easily synthesized through in w,„ chemical methods Instead 
Polypeptides and proteins for use in UPAs are usual.y expressed using one of several we,l known 
ce„u,ar expression systems, including those described above. Alternatively, proteins can be isolated 
from the, nat.ve environment, for instance from tissue samples or environmental samples, or from 
express™ chambers in the case of engineered expressed polypeptides. After extraction and 
appropriate purification, the po.ypeptide can be deposited onto the array using any of a variety of 
techniques. J 

In the methods disclosed in this applications, targe, polypeptides can be delivered to the 
substrate of the array by various different mechanisms. One is by flowing within a channel defined 
on predefined regions of the array substrate. Typical "flow channel" application methods for 
applyng the polypeptides to arrays of the present invention are represented by dot-b.o. or slot-blot 
systems (see. U.S. Patents No. 4,427.4,5 and 5,283,039, One a.temativc method for app.yin* 
the targets to the array substrate is "spotting" the targe, polypeptide on predefined regions (each * 
corresponding ,o an array address). ,n a spotting .echnique, ,he ,arget molecules are delivered by 
d.rectly depositing (rather than flowing) re.atively sma.i quantities of to in se.ected regions For 
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: move om add,ess ,o *-* - — — * — y 

«- «» Typ.c, a, w ^ m ink . e , prjmcr or a » 
» the subs™. ,„ „, he , embodiments, , he d . . l ifc M . 

Usually the urge, po ly pep„des are dcposiK „ „„ ,„ e 

™»-»«''*'p*««*-*- w . to -d rfltabhdl , plK11 , aill .^- 

»s,„s buffers of me UPA sy s,em ( , g „ low or high tato „ sin>pi ~ 

*»» 5 /, no more than 3K of the „o iy „ tide „„ the , my „ ,„ e jn< , o 
washed off using buffers „f „e UPA system. 
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Depe„d,n g on the array substrate used, the substrate alone may substantially irreversibly 

Peptides. Examples of , inking processes are ^ tQ those tf ^ fa ^ ^ ^ 
su strates that require such a linking process jn ^ to bind ^ 

polype P t,des op.ion.Uy may be attached to the array substrate through ,i„ k er molecules 

in certain embodiments, the non-samp.e regions of the array surface (those regions ^ 

of*e probe molecules directly to the army surface. 

I, is benelicia, i„ certain embodiments ,„ apply a know, amount of each ,a, g e, po, y pep,ide 

»PP , ,. eacH spot. Quantify. and ^ ^ 

-d„ s ,ff, niry ^een the different arge, M_« of ,e amount of J* 2« 
prot.ms may b, carried on, through „a„ y „c h „i,„es we,, known „ , h e a „. T6e se Jude 

Ma„ y other „.„,i q „es could be used t. measure the amount of a target protein present in a 
-P., For ins^ce, the amount of targetprotein in , sample cold be measured using a 

y poo.ed mixture of more than one substan[ja||y pure [arget polypept . de 
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All of the addresses on the array may contains pools of polypeptide, or only some of the addresses 
depending on the use of the array. For instance, in some circumstances it may be desirable to array a 
targe, polypeptide associated with one or more non-target polypeptides, for instance a stabilizing 
polypeptide or linker molecule. In addition, the native conformation of certain binding sites on 
proteins can only be assayed for probe binding when the target polypeptide is associated with other 
molecules, for instance when the target polypeptide natively exists as one subunit of a mu.timeric 
complex. Pooled arrays of the current invention include those on which one or more of the addresses 
conta.ns a multimeric polypeptide complex. In the case of such an array, it is envisioned that 
d.fferent probe molecules may bind to different polypeptides within the complex of "target" 
polypeptides. 

Although the identity of each probe in the pooled mixture at a specific address is known the 
.ndivdua. probes in the pool are not "separately addressable." The binding signal from a pooled 
address is the binding signal of the se, of different (but mixed or associated) polypeptides occupyino 
that address. In general, an address is considered to display binding of a probe molecule if at least = 
one polypeptide occupying the address binds to the probe molecule. 

Arraying pooled samples is also a powerful tool in high-throughput technologies for 
increasing the information that is yielded each time the array is assayed. Methods for analyzing 
s.gnals from arrays containing pooled samples have been described, for instance in U.S. Patent No. 
5,744,305, incorporated herein by reference in its entirety. 

1: Choice of probe molecule(s') 

Any molecule that might bind to or interact with one or more polypeptides can be used as a 
probe with the disclosed arrays. In specific embodiments of the current invention, probes may be 
from different molecular classes (e.g., nucleic acids, oligo- or polypeptides, or various tvpes of 
ligands). Probes (especially those that are polymeric chains) may be of various lengths, and different 
results may be obtained from the same array by using related probe molecules of different length 
Likewise, varying the sequence of polymeric chain probes may provide valuable binding data" 

Though in many embodiments of the invention a single type of probe molecule (for instance 
one protein) at a time will be used to assay the array, in some embodiments, mixtures of probes will 
be used simultaneously, for instance mixtures of two proteins or two nucleic acid molecules 
Simultaneous multiple-probing (e.g. double-probing) can be used to detect either competitive bindm* 
or bmdmg systems that require the interaction of more molecules than just one polypeptide target and 
one probe molecule. 

£ Labeling and detection of probe molecule^ 

Usually, probe molecules used to assay the disclosed UPAs are detectable. Probes can be 
detectable based on their inherent characteristics (e.g., immunogenicity) or can be rendered 
detectable by being labeled with an independently detectable tag. Such tags include fluorescent or 
luminescent molecules that are attached to the probe, or radioactive monomers or molecules that can 
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be added during or after synthesis of the probe molecule. Other tags may be immunogenic sequences 
(such as epitope tags) or molecules of known binding pairs (such as members of the 
strept/avidin:biotin system). Other tags and detection systems are known to those of skill in the art, 
and can be used in the present invention. 

Labeling different probes with different tags to enable simultaneous detection of binding of 
two or more probes on the polypeptides of an array. Multiple-label challenges to an array of this 
invention can also be used to examine any competitive binding between the two arrays on different 
polypeptides of the array. For competitive binding assays, however, only one of the probes needs to 
be detectable. 

^ Computer assisted (automated) detection and analysis of UPAs 

The data generated by assaying a universal protein array according to this invention can be 
analyzed using known computerized systems. For instance, the array can be read by a computerized 
"reader" or scanner and the quantification of the binding of probe to individual addresses on the array 
carried out using computer algorithms. Such analysis of the array can be referred to as "automated 
detection" in that the data is being gathered by an automated reader system. 

In the case of labels that emit detectable electromagnetic wave or particles, the emitted light 
(e.g., fluorescence or luminescence) or radioactivity can be detected by very sensitive cameras, 
confocal scanners, image analysis devices, radioactive film or a Phosphoimager, which capture the 
signals (such as a color image) from the array. A computer with image analysis software detects this 
image, and analyzes the intensity of the signal for each probe location in the array. Signals can be 
compared between spots on a single array, or between arrays (such as a single array that is 
sequentially probed with multiple different probe molecules). 

Computer algorithms can also be used for comparison between spots on a single array or on 
multiple arrays. In addition, the data from an array can be stored in a computer readable form. 

Certain examples of automated array readers (scanners) will be controlled by a computer 
and software programmed to direct the individual components of the reader (e.g., mechanical 
components such as motors, analysis components such as signal interpretation and background 
subtraction). Optionally software may also be provided reader to control a graphic user interface and 
one or more systems for sorting, categorizing, storing, analyzing, or otherwise processing the data 
output of the reader. 

To "read" an array according to this invention, an array that has been assayed with a 
detectable probe to produce binding (eg., a binding pattern) can be placed into (or onto, or below, 
etc., depending on the location of the detector system) the reader and a detectable signal indicative of 
probe binding detected by the reader. Those addresses at which the probe has bound to immobilized 
polypeptide sample provide a detectable signal, e.g., in the form of electromagnetic radiation. These 
detectable signals could be associated with an address identifier signal, identifying the site of the 
complex. The reader gathers information from each of the addresses, associates it with the address 
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identifier signal, and recognizes addresses with a detectable signal as distinct from those not 
producing such a signal. The reader is also capable of detecting intermediate levels of signal, 
between no signal at all and a high signal, such that quantification of signals at individual addresses 
is enabled. 

Certain readers that can be used to collect data from the arrays of this invention, especially 
those that have been probed using a fluorescently tagged molecule, will include a light source for 
optical radiation emission. The wavelength of the excitation light will usually be in the UV or visible 
range, but in some situations may be extended into the infra-red range. A beam splitter can direct the 
reader-emitted excitation beam into the object lens, which for instance may be mounted such that it 
can move in the x, y and z directions in relation to the surface of the array substrate. The objective 
]ens focuses the excitation light onto the array, and more particularly onto the (polypeptide) targets 
on the array. Light at longer wavelengths than the excitation light is emitted from addresses on the 
array that contain fluorescently-labeled probe molecules (i.e., those addresses containing a 
polypeptide to which the probe binds). 

In certain embodiments of the invention, the array may be movably disposed within the 
reader as it is being read, such that the array itself moves (for instance, rotates) while the reader 
detects information from each address. Alternatively, the array may be stationary within the reader 
while the reader detection system moves across or above or around the array to detect information 
from the addresses of the array. Specific movable-format array readers are known and described, for 
instance in U.S. Patent No. 5, 922,617, hereby incorporated in its entirety by reference. Examples of 
methods for generating optical data storage focusing and tracking signals are also known (see, for 
example, U.S. Pat. No. 5,461,599, hereby incorporated in its entirety by reference). 

For the electronics and computer control, a detector (e.g., a photomultiplier tube, avalanche 
detector, Si diode, or other detector having a high quantum efficiency and low noise) converts the 
optical radiation into an electronic signal. An op-amp first amplifies the detected signal and then an 
analog-to-digital converter digitizes the signal into binary numbers, which are then collected by a 
computer. 



III. Examples 

Example 1: Preparation of a UPA 
Methods and Materials 

To identify target proteins to which the transcriptional coactivator p52 will bind, the proteii 
array system for which a target arrangement key is shown in Table 1 was provided. The general 
transcription factors, activators and coactivators arrayed were overexpressed either in bacteria, 
baculovirus or in mammalian cells and purified to near homogeneity as previously described (Chian 
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etal.,EMBOJ., 12,2749-2762, 1993; Kershnar etal,J. Biol Chem., 18, 34444-34453, 1998; Luo 
etai, Cell, 71, 231-241, 1992; Jackson andTjian, Proc. Natl Acad. Sci. USA, 86, 1781-1785, 1989; 
Ge et al, Methods Enzymol, 274, 57-71, 1996). The serine-arginine (SR) protein fraction was 
prepared from HeLa cell nuclear extracts essentially according to Zahler et ai {Genes Dev., 6, 837- 

5 847, 1992). GST-nucleolin fusion protein (GST-Nu, address I2e/f) was prepared by overexpressing 
plasmid GST-HNB (provided by Dr M. Srivastava), which contains nucleolin coding sequence 
positions 290-707, in bacteria and purified on a glutathione-Sepharose column. Glutathione S- 
transferase fused to a HMK site (RRASV) (GST-K) {Ge et ai, Mol Cell, 2, 751-759, 1998) was used 
as a negative control in the experiments. 

0 An average of 7.5 pmol (normalized by Bio-Rad protein assay, Bio-Rad, Hercules, CA) of 

each of the 48 highly purified proteins (or fractions) was spotted on a 12 x 8 cm nitrocellulose 
membrane using a 96-well dot blot apparatus (Bio-Rad, Hercules, CA). This apparatus provides 
sample application to a membrane to form an array arranged in twelve rows and eight columns. The 
arrangement of the polypeptide targets in the array is shown in Table 1, which corresponds to the 

5 array results shown in Figures 1 and 2. Each sample was duplicated in two adjacent wells to provide 
a useful internal control. 

Each sample was diluted to 100 u1 with buffer A 100 (100 mM KC1, 10% glycerol, 20 mM 
HEPES Na pH 7.9, 0.2 mM EDTA, 10 mM 2-mercaptoethanol and 0.5 mM PMSF) and duplicated in 
two adjacent wells. Each well was rinsed with 2 x 500 u.1 buffer A 100 and the vacuum kept for 3-5 

0 minutes. After removal from the dot blot apparatus, the protein array was rinsed with two changes of 
buffer A 100. 
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Table l 1 





a 


b 


c 


d 


e 


f 


g h 


1 


(0 

TFIIA | TFIIA 


(2) 

TFIIB | TFIIB 


(3) 

TBP | TBP 


(4) 

fTFIID | f:TFIID 


2 


(5) 

TFIIE | TFIIE 


(6) 

TFIIF | TFIIF 


(7) 

f:TFIIH | f:TFHH 


(8) 

Pol II | Pol II 


3 


(9) 

RXR | RXR 


(10) 
TR | TR 


(11) 

Oct 1 | Oct 1 


(12) 
Spl | Spl 


4 


(13) 

G4-94 | G4-94 


(14) 

G4-I47 | G4-147 


(15) 

G4-AH | G4-AH 


(16) 

G4-VPI6 | G4-VPI6 


5 


(17) 

G4-CTF | G4-CTF 


(18) 

G4-Spl | G4-Spl 


(19) 

G4-E1A | G4-EIA 


(20) 

G4-IE | G4-IE 


6 


(21) 

G4-Tat | G4-Tat 


(22) 

PC4-P | PC4-P 


(23) 

PC4-N | PC4-N 


(24) 

PC4-C | PC4-C 


7 


(25) 

PC4-AS | PC4-AS 


(26) 

PC4-ml | PC4-ml 


(27) 

PC4-m2 | PC4-m2 


(28) 

PC4-m3 | PC4-m3 


3 


(29) 

PC4-m4 | PC4-m4 


(30) 

PC4-m5 | PC4-m5 


(31) 

PC4-m6 | PC4-m6 


(32) 

PC4-m7 | PC4-m7 


9 


(33) 

PC4-wt | PC4-wt 


(34) 
p52 | p52 


(35) 
P75 | P 75 


(36) 

p75-C | p75-C 


10 


(37) 

p300-C | p300-C 


(38) 

PCAF | PCAF 


(39) 

PCAF-C | PCAF-C 


(40) 

TAF250 1 TAF250 


11 


(4 

Topo I 
(wt) 


1) 

Topo I 
(wt) 


(4 

Topo I 
(mt) 


2) 

Topo I 
(mt) 


(4 

Topo I 
(wt)* 


3) 

Topo I 
(wt)* 


(4 

Topo 1 
(nati) 


4) 

Topo 1 
(nati) 


12 


(4 

ASF 


5) 

ASF 


(4 

SR 
(+nucl) 


6) 

SR 
(+nucl) 


(4 

GST-Nu 


7) 

GST-Nu 


(4 

GST-K 


8) 

GST-K 



a Abbreviations used in Table 1 are explained above, in ihe Abbreviations section (1A). 



Each sample was duplicated in two adjacent wells. The actual size of the membrane is 12 X 
8 cm (height X width) with eight columns and twelve rows. 

Example 2: Removal of probe molecules from 
the UPA. 

Methods and Materials 

The same universal protein array that was prepared in Example I was reused with a protein 
probe (Example 3), a dsDNA probe (Example 4), a ssDNA probe (Example 4), a RNA probe 
(Example 5) and a ligand probe (Example 6). After each use, the filter was stripped with buffer A 
containing I M (NH 4 ) 2 S0 4 and 1 M urea at room temperature for 30-60 minutes. Then the stripped 
array was equilibrated with buffer A 100 before being incubated with another probe. 
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Exampie 3: Interaction with a protein probe. 

Methods and Materials 

Purified GST-K-p52 protein (Ge etal, Mol. Cell, 2, 751-759, 1998) was labeled by heart 
muscle kinase (HMK) in a 50 ul reaction containing 10 ug of substrate protein, 40 uCi [y- 52 P]ATP 
and 10 U of the catalytic subunit of Ca-independent protein kinase A from bovine heart (Sigma, St. 
Louis, MO) at 30°C for 30 minutes. The 32 P-labeIed protein was purified through glutathione- 
Sepharose beads to separate uncoupled free nucleotide. In the case of the ASF/SF2 probe, pETl la- 
6H(K)-ASF/SF2 was created by inserting the ASF/SF2 coding region into the vector pETl la-6H(K) 
(Ge et aL, Mol. Cell, 2, 751-759, 1998) and overexpressed in Escherichia coli cells. Recombinant 
protein was affinity purified and labeled by HMK in vitro as described above. Pre-treatment took 
place in buffer A 100 containing 1% non-fat milk at room temperature for at least 30 minutes. The 
array was then incubated with 30-50 ng probe/ml buffer A100 (+1% milk) at4°C for over 12 hours. 
After incubation, the array was sequentially washed with three changes of buffer A 100 (100 mM 
KC1), A500 (500 mM KC1) and A 1000 (1000 mM KC1). The resulting signals were visualized by 
autoradiography (exposure from 30 minutes to 10 hours) and quantified with a densitometer 
(Molecular Dynamics, Sunnyvale, CA). 

Results 

GST-K-p52 was labeled in vitro with [y- 32 P]ATP by HMK (Ge et aL, Mol. Cell, 2, 751-759, 
1998) and further purified through glutathione-Sepharose beads. The protein array was first treated 
with buffer A 100 containing 1% non-fat milk and then incubated with 32 P-labeled GST-K-p52 as 
described in Materials and Methods. The filter was extensively washed with buffer A containing 
100, 500 and 1000 mM KC1 prior to each autoradiographic analysis. A low salt wash (with 100 mM 
KC1) allowed the detection of most possible interactions (Fig. 1A), while a high salt (with 500-1000 
mM KC1) allowed the detection of highly specific and high affinity interactions (Fig. IB). No 
significant difference was found between the 500 and 1000 mM salt washes. The relative affinity of 
each tested protein for the probe could be measured with either a densitometer or a phosphorimager 
(Fig. 1C). Among all 48 proteins (or fractions), the SR protein fraction (addresses 12c/d) and the 
recombinant GST-nucIeolin (addresses !2e/f) had the highest affinities for the transcriptional 
coactivator p52. 

It has previously been shown that, in addition to the ability to interact specifically with a 34 
kDa doublet corresponding to the splicing factor ASF/SF2, p52 could also interact strongly with a 
100 kDa protein found to be present in the SR fraction by far-western blot analysis (Ge et aL, Mol 
Cell, 2, 751-759, 1998). Protein microsequence analysis indicated that the 100 kDa band isolated 
from the SR protein fraction contained two proteins, nucleolin and DNA topoisomerase I (topo I). In 
the present experiment, p52 strongly interacted with the recombinant GST-nucleolin but not with 
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topo I, either recombinant proteins expressed in baculovirus (Fig. I, addresses 1 la-f) or naturally 
purified protein from mammalian ceils (addresses I ]g/h). This observation demonstrates that p52 
interacts with the nucieolin rather than the topo I present in the SR protein fraction, which is 
consistent with the recent observation that nucieolin is a component of the multiprotein complex 
associated with p52 in HeLa cells. 

Nucieolin has been implicated in regulating pre-rRNA processing (Bouvet et al, EMBO J., 
16, 5235-5246, 1997), pre-mRNA splicing (Ishikawa et al t Mol Cell. Biol, 13,4301-4310, 1993), B 
cell-specific transcription (Hanakahi et al, Proc. Natl. Acad Sci. USA, 94, 3605-3610, 1997), 
unwinding DNA, RNA or DNA-RNA duplexes (Tuteja et ai, Gene, 28, 143-148, 1995) and 
mediating ceil doubling time in human cancer ceils (Derenzini et ai, Lab. Invest, 73, 497-502, 
1995). Like the splicing factor ASF/SF2, nucieolin also contains RNP type RNA-binding domains as 
well as RGG repeats (Bouvet et ai, EMBO J., 16, 5235-5246, 1997; Valdez et ai, Mol Immunol, 
32, 1207-1213, 1995). Its activity can be modulated through mitosis-specific phosphorylation by 
p34cdc2 kinase or casein kinase II (Tuteja et ai, Gene, 28, 143-148, 1995). Therefore, it would be 
interesting to further examine the biological significance of nucieolin interaction with the general 
transcriptional coactivator and splicing regulator p52. 

In addition to GST-K-p52, other protein probes have also been tested in the UFA system. 
Figure 3 shows the binding activity of 32 P-Iabeled splicing factor ASF/SF2, a member of the SR 
protein family, to 16 different selected proteins in a 4 by 4 array (Fig. 3 A). ASF/SF2 significantly 
bound to five of the 16 proteins, including the affinity-purified TFIID complex (Fig. 3B and C, 
address 2d), retinoid-X receptor (address 3a), histone HI (address 3c), co-histones (address 3d) and 
ASF/SF2 itself (address 4b). However, after washing the UPA with 500 mM KC1, ASF/SF2 
appeared to have the highest affinity for itself (Fig. 3C, address 4b), which is in agreement with the 
previous observation that in vitro translated ASF/SF2 could strongly bind to GST-ASF/ SF2 in a GST 
pull down assay (Xiao and Manley, EMBO J., 17, 6359-6367, 1998). However, ASF/SF2 also 
showed high affinity for the TFIID complex. Since ASF/SF2 did not interact with TBP (address 2c), 
ASF/SF2 might interact directly with TBP-associated factors. Whether such an interaction reflects 
the function of TFIID or ASF/SF2 in transcription or pre-mRNA splicing or coupling of these could 
also to be investigated using the disclosed UPA technology. Taken together, these experiments 
demonstrate that UPA can be used to detect protein interactions with various targets. 

Using the same UPA, it is shown that PC4 with a single point mutation (Phe— Pro) at 
position 77 lost both dsDNA- and ssDNA-binding activity (Fig. 2A and B, addresses 8c/d), but still 
retained RNA-binding activity (Fig. 2C, addresses 8c/d). In contrast, phosphorylation of PC4 by 
casein kinase II stimulated the DNA-binding activity (Fig. 2A and B, addresses 6c/d), but reduced its 
RNA-binding activity (Fig. 2C, addresses 6c/d). These observations demonstrate that UPA is an 
effective method to map protein interaction domains and DNA- or RNA-binding domains of a 
protein. 
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Example 4: Interaction with a DNA probe. 

Methods and Materials 

5 To test whether the UPA system could also be used to detect interactions with other (e.g., 

biological) molecules, the same array was stripped (see Example 2) and reprobed with a 32 P-labeled 
double-stranded oligonucleotide (64 bp) containing the adenovirus major late core promoter 
elements. 

A double-stranded (ds) oligonucleotide (64 bp with plus strand S'-GGGGGGCTATAAAA- 
10 GGGGGTGGGGGCGCGTTCGTCCTCACTCTCTTCCGCATCGCTGTCTGCG and minus strand 
S'-CCCTCGCAGACAGCGATGCGGAAGAGAGTGAGGACGAACGCGCCCCCACCCCCTTTT- 
ATAGCCC) corresponding to the adenovirus major late promoter region from -39 to +29 was 
labeled at the 3'-end of the minus strand with Klenow fragment in the presence of [ 3: P]dCTP. After 
labeling, the free nucleotides were separated from the probe by passing the labeling reaction through 
1 5 a G-50 nick column (Pharmacia Biotech, United Kingdom). Pre-trcatment took place with buffer A 
containing 60 mM KC1, 2x Denhardt's solution and 25 ug/ml poly(dGdC) (Sigma, St. Louis, MO) at 
room temperature for 30 minutes. For interaction, 5 ng/ml of 3: P-labeled double-stranded (ds)DNA 
was added to the same buffer and incubation was carried out at 4° C for >12 hours. The array was 
(hen sequentially washed with three changes of buffer A 100, A500 and A 1000 followed by 
20 autoradiography and quantification. 

To analyze the array with a single-stranded (ss)DNA probe, the 64-mer minus strand of the 
dsDNA probe was labeled at the 5'-end by T4 polynucleotide kinase in the presence of y-[ 32 P]ATP. 
Other conditions were exactly the same as those for the dsDNA probe. 

25 Results 

The results shown in Figure 2A indicate that, after washing with 500 mM salt, 
phosphorylated PC4 (PC4-P, addresses 6c/d), an inactive form of a previously described 
transcriptional coactivator (Ge et al. y Proc. Natl. Acad. Sci. USA, 91, 12691-12695, 1994), purified 
from HcLa cells had the highest affinity for the tested dsDNA probe among 48 samples (see 

30 quantification in Table 2). PC4-P had 3- to 5-fold higher affinity for dsDNA compared to other PC4 
derivatives, including wild-type PC4 (addresses 9a/b). In contrast, a single amino acid change at 
position 77 (Phe-*Pro) completely abolished the dsDNA binding ability of PC4 (addresses 8c/d). 
These results are in agreement with the observations reported recently using gel mobility shift assays 
that phosphorylated PC4 bound bubble DNA with higher affinity and the region around position 77 

35 was critical for the DNA-binding activity of PC4 (Werten, et at., EMBO J. , 17, 5103-51 1 1, 1998). 
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Although it is known that TBP can specifically bind the present probe, the signal is 
relatively weak compared to other DNA-binding proteins. This result is consistent with the 
observation from gel mobility shift assays that the binding activity of TBP to TATA box-containing 
DNA was barely detectable. However, it can be significantly enhanced by the presence of another 
transcription factor, TFIIA (Orphanides et al. y Genes Dev., 10, 2657-2683, 1996). On the other hand, 
however, many other general (non-sequence-specific) DNA-binding proteins had much stronger 
signals than TBP, suggesting that the present system may not be suitable for determining the binding 
activity of sequence-specific DNA-binding (and/or RNA-binding) proteins. ASF/SF2 was identified 
as an RNA-binding protein playing an essential role(s) in pre-mRNA splicing. Both the recombinant 
ASF/SF2 (addresses 12a/b) and the native ASF/SF2-containing SR protein fraction (12c/d) bound 
dsDNA as well as ssDNA (see below) very strongly, even tighter than most of the DNA-binding 
proteins tested (see quantification in Table 2), indicating that ASF/SF2 is also a DNA-binding 
protein. After the array was analyzed with a ssDNA probe (Fig. 2B), although several differences 
were observed, the overall pattern of protein-ssDNA interactions was similar to that of protein- 
dsDNA interactions, suggesting that most DNA-binding proteins are capable of binding both dsDNA 
and ssDNA. 

Example 5: Interaction with a RNA probe. 

Methods and Materials 

An SV40 early pre-mRNA was synthesized in vitro from the plasmid pSVi66 by SP6 RNA 
polymerase as previously described (Ge et al, Mol Cell, 2, 751-759, 1998). Interaction was carried 
out at 4° C for >12 hours in the presence of 20 mM HEPES Na pH 7.9, 5% glycerol, 10 mM 2- 
mercaptoethanol, 0.2 mM EDTA Na pH 8.0, 60 mM KCI, 2 mM MgCK, 0.5 mg/ml BSA, 25 ug/ml 
tRNA and -5 ng/ml 32 P-Iabeled SV40 early pre-mRNA. The array was then sequentially washed and 
visualized by autoradiography as described for the DNA probes. 

Results 

This protein array system was also used successfully to analyze interactions with an RNA 
probe transcribed from the SV40 early region-containing plasmid pSVi66 (Ge et al, Mol Cell, 2, 
751-759, 1998). Several interesting observations were revealed (see Fig. 2C). First, phosphorylation 
by casein kinase II in vivo apparently decreased the affinity of PC4 for the RNA probe (addresses 
6c/d in Fig. 2C; see also Table 2), although it increased the affinity of PC4 for both the dsDNA and 
ssDNA probes (addresses 6c/d in Fig. 2A and B). Second, in contrast to the DNA-binding activity, 
the RNA-binding activity of PC4 was not significantly affected by the mutation at position 77 
(addresses 8c/d in Fig. 2C). Third, both p52 and p75 strongly bind the RNA probe (addresses 9c-h in 
Fig. 2C), but did not significantly bind either the dsDNA or ssDNA probe in this assay (addresses 9c- 
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h in Fig2A and B). Finally, PCAF, a p300/CBP-associated factor that functions as a histone 
(Ogryzko et al., Cell, 87, 953-959, 1996), could bind the RNA probe very strongly (addresses lOc-f 
in Fig. 2C; see Table 2 for quantification), suggesting a possible role of PCAF in RNA metabolism. 



5 Example 6: interaction with a ligand probe 

Methods and Materials 

L-3,5,3'-[ ,25 I]Triiodothyronine (T3) was purchased from NEN (Boston, MA, catalog no. 
NEX 1 1 OH). The interaction conditions were essentially the same as for the RNA probe except that 
10 tRNA was omitted and 0.3 uCi/ml [ l25 I]T3 was added instead of the RNA probe. 

Results 

This protein array system was also used successfully to analyze interactions with a 123 I- 
labeled ligand, T3. Only the recombinant thyroid hormone receptor bound l25 I-Iabeled T3 strongly 
1 5 and specifically (addresses 3c/d in Fig. 2D). 



Table 2 



u 


Position 


Protein/source 


p52" 


ds 
DNA " 


ss 
DNA n 


RNA n 


Function/Description 


1 


la/b 


TFlIA/bacteria' 


12.3 


0.9 


15.1 


0.3 


class II gene transcription 


2 


Ic/d 


TFIIB/bacteria ? 


0 


0 


0.2 


0 


class II gene transcription 


3 


lc/f 


TBP/bacteria 3 


5.1 


0.5 


5.9 


35.8 


class II gene transcription 


4 


lg/h 


f:TFHD/HeLa b 


8 


3.3 


1.9 


12.4 


class II gene transcription 


5 


2a/b 


TFIIE/bacteria 3 


0 


0 


0.2 


0.6 


class II gene transcription 


6 


2c/d 


TFIlF/bacteria J 


0.7 


0 


0.3 


1.4 


class 11 gene transcription 


7 


2e/f 


f:TFIIH/HeLa c 


1.4 


2.5 


0.4 


4.7 


class II gene transcription 


8 


2g/h 


RNA pol Il/HeLa ' 


9 


22,4 


10.8 


6.1 


class 11 gene transcription 


9 


3a/b 


RXR/bacteria 


30 


11.2 


37.9 


34.6 


activator (retinoid-X receptor) 


10 


3c/d 


TR/bacteria 


16.3 


29.3 


27.5 


52.4 


activator (thyroid hormone receptor) 


11 


3e/f 


Octl/HeLa* 


10.5 


3.5 


4.7 


9 


B cell specific activator 


12 


3g/h 


Spl/HeLa c 


3.2 


2.6 


2.2 


1.3 


class 11 gene activator 


13 


4a/b 


G4-94/bacteria 


0.5 


0.2 


1 


40.2 


activator (DNA binding domain) 


14 


4c/d 


G4-147/bacteria 


3 


0.1 


0.1 


8.7 


activator (DNA binding domain) 


15 


4e/f 


G4-AH/bactcria a 


1.9 


0.5 


0.3 


3.8 


class 11 gene activator 


16 


4g/h 


G4-VP16/bacteria 


1.5 


1.3 


0 


0 


class 11 gene activator 


17 


5a/b 


G4-CTF/bacteria 


0.9 


0 


0.1 


8.2 


class 11 gene activator 


18 


5c/d 


G4-SpI/bacteria 


4.5 


16.6 


9.3 


76.7 


class II gene activator 


19 


5e/f 


G4-E1 A/bacteria 


1.8 


1.2 


0 


8 


class 11 gene activator 


20 


5g/h 


G4-IE/bacteria 


4 • 


0.8 


0 


0.9 


class II gene activator 


21 


6a/b 


G4-Tat/bacteria 


2.3 


1.6 


3.4 


15.7 


class II gene activator 


22 


6c/d 


PC4-P/Hel,a ' 


5.4 


100 


77.5 


14.5 


coactivator (phosphorylated) 


23 


6e/f 


PC4-N/bacteria 


16.8 


2.3 


0.8 


10.6 


PC4 (C-terminal deletion) 


24 


6g/h 


PC4-C/bacteria 


30.3 


1.6 


0.1 


37.5 


PC4 (N-terminal deletion) 


25 


7a/b 


PC4-AS/bacteria 


3.9 


35 


41.6 


87 


PC4 (CK11 sites mutated) 


26 


7c/d 


PC4-ml /bacteria 


3.9 


23.2 


44.4 


75.1 


PC4 K.23I/IC29A) 


27 


7e/f 


PC4-m2/bacteria 


10.4 


23.6 


48.3 


71.9 


PC4 (K35I/K4IA) 
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28 


7g/h 


PC4-m3/bacteria 


S 4 


9 1 7 


d 1 9 


AI 9 


rv-H (KZ /A/N^ol/rvtyA) 


29 


8a/b 


PC4-m4/bacteria 


J.O 


J I.J 


4J 


inn 


rd (K4/N/KJ Jl/lOyA) 


30 


8c/d 


PC4-m5/bacteria 


9 9 


1 d 


H.J 


AO S 
oy. j 


r\^ t * yr f fr) 


31 


8e/f 


PC4-m6/bacteria 


9 8 


o / 


kf\ 7 
OD. / 


87 1 


OPd /|/')QA\ 


32 




PC4-m7/hacteria 


n a 

u.o 


97 d 


jo.y 


7S d 
/ j.f 


rL4 (K.41A) 


33 


9a/b 


PC4-wt/hacteria 




17 


A£ 7 
DO. / 


79 7 


transcriptional coactivator (wild type) 


34 


9c/d 


nS9/hnr*tf*ria * 


9 1 


U.o 


A 

\j 


/lO tt 


transcriptional coactivator 


35 


9e/f 


Y f Jl uuClwl Id 


J 


l a 
i .4 


A 

U 


JH.O 


transcriptional coactivator 


36 




n7S-P/hartf*ria * 


ri 
\) 


i .j 


n *7 


OD.Z 


coactivator (C-terrninal 326-530) 


37 


lOa/b 


flfl-f'/h^ri 1 1 rvvi rt i c ^ 


A fx 


1 1 A 

1 1 .0 


J 1 .3 


1 A C 

14. j 


transcriptional coactivator (1 135-2414) 


38 


lOc/d 




9 < 

Z.J 




1 A Z 

14. j 


no c 


histone acetyltransferse 


39 


lOe/f 


PCAF-C/baculovirus h 


5.3 


8.4 


8 


74.3 


PCAF n52-fn9 1 i 

I vrVI \*J*J^ xjmj^ J 


40 


lOg/h 


TAF250/baculovirus i 


1.7 


1.1 


0.6 


10.9 


transcriptional coactivator 


41 


lla/b 


Topo I/baculovirus i 


1.1 


1 


0.9 


12.4 


DNA unwinding/transcription 


42 


llc/d 


Topo I/bacuIovirus j 


4.7 


2.4 


2.1 


12.6 


Topo I (Y723F) 


43 


Ue/f 


Topo I/bacutovirus k 


5 


1.6 


1.1 


6 


Topo 1 (wild type) 


44 


llg/h 


Topo I/HeLa 


2.3 


1.7 


0.7 


76.5 


native Topo 1 0 


45 


12a/b 


ASF/SF2/bactcria < 


13.5 


33.5 


89 


40.4 


splicing factor (SR protein) 


46 


12c/d 


SR/HeLa 1 


55 


80 


100 


9.7 


splicing factors (SR family) 


47 


12c/f 


GST-Nu/bacteria m 


100 


0.4 


3.3 


8.3 


pre-rRNA processing factor (nucleolin) 


48 


12g/h 


GST-KTbacteria 


2.8 


0.4 


1.5 


1.8 


negative control 



' Ge et ai, Methods Enzymoi, 274, 57-71 , 1996 

b Chiang et at., EMBOJ., 12, 2749-2762. 1993 

( Kcrshnare/a/.,./. Biol. Chem., 18, 34444-34453, 1998 
5 'LuoetaL.Cell, 71,231-241, 1992 

c Jackson and Tjian ; Proc. Natl. Acad. Set. USA y 86. 1781-1785, 1989 

r Ge et ai. Proc. Natl. Acad. Set. USA. 9 1 , 1 269 1 - 1 2695. 1 994 

«Ge etai. Mot. Celt. 2. 751-759, 1998 

h Ogryzko et a!., Celt. 87, 953-959. 1996 
10 'Mizzene/a/..CW/ r 87. 1261-1270. 1996 

* Wang and Roeder. Mol. Cell, I, 749-757, 1998 

k Pourquicr et at., J. Bioi Chem., 272. 2644 1-26447. 1997 

1 Zahler et ai. Genes Dev., 6 f 837-847. 1992 

m Valdeze/aA, Mot. Immunol., 32, 1207-1213, 1995 
1 5 n Relative binding affinity of the specified probe to each target on the array, normalized to the highest sienal for each probe 

3 TopoGEN Inc.. Columbus, OH 



The number, position (address), name/source (and related reference), affinities for each probe 
and known function for each of the 48 target polypeptides are indicated. The highest affinities of the 
20 individualized proteins for each probe molecule [GST-nucIeoiin for p52 (addresses 12e/f), PC4-P for 
the dsDNA (addresses 6c/d), SR for the ssDNA (addresses 12c/d) and PC4-m4 for the RNA 
(addresses 8a/b)] where normalized to 100 and are indicated in bold. 



Example 7: Kits 



UPAs as disclosed herein can be supplied in the form of a kit for use in molecule binding 
analyses. In such a kit, at lest one polypeptide array is provided. The kit will also include 
instructions, usually written instructions, to assist the user in probing the array. Such instructions can 
optionally be provided on a computer readable medium. 
30 Kits may additionally include one or more buffers for use during assay of the provided 

array. For instance, such buffers may include a low stringency, a high stringency wash, and/or a 
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stripping solution. These buffers may be provided in bulk, where each container of buffer is large 
enough to hold sufficient buffer for several probing or washing or stripping procedures. 
Alternatively, the buffers can be provided in pre-measured aliquots, which would be tailored to the 
size and style of array included in the kit. 

Certain kits may also provide one or more containers in which to carry out array-probing 
reactions. 

Kits may in addition include either labeled or unlabeled control probe molecules, to provide 
for internal tests of either the labeling procedure or probing of the UPA, or both. The control probe 
molecules may be provided suspended in an aqueous solution or as a freeze-dried or lyophilized 
powder, for instance. The container(s) in which the controls are supplied can be any conventional 
container that is capable of holding the supplied form, for instance, micro fuge tubes, ampoules, or 
bottles. In some applications, control probes may be provided in pre-measured single use amounts in 
individual, typically disposable, tubes or equivalent containers. 

The amount of each control probe supplied in the kit can be any particular amount, 
depending for instance on the market to which the product is directed. For instance, if the kit is 
adapted for research or clinical use, sufficient control probe(s) likely will be provided to perform 
several controlled analyses of the array. Likewise, where multiple control probes are provided in one 
kit, the specific probes provided will be tailored to the market. In certain embodiments, a plurality of 
different control probes will be provided in a single kit, each control probe being from a different 
ciass of molecules (e.g., a nucleic acid probe, a protein probe, a ligand probe, etc.). 

In some embodiments of the current invention, kits may also include the reagents necessary 
to carry out one or more probe-labeling reactions. The specific reagents included will be chosen in 
order to satisfy the end user's needs, depending on the type of probe molecule (e.g., nucleic acid, 
polypeptide, or ligand) and the method of labeling (e.g., radiolabel incorporated during probe 
synthesis, attachable fluorescent tag, etc.). 

Further kits are provided for the labeling of probe molecules for use in assaying arrays 
provided herein. Such kits may optionally include an array to be assayed by the so labeled probe 
molecules. Other components of the kit are largely as described above for kits for the assaying of 
UPAs. 

In view of the many possible embodiments to which the principles of our invention may be 
applied, it should be recognized that the illustrated embodiments are only a certain examples of the 
invention and should not be taken as a limitation on the scope of the invention. Rather, the scope of 
the invention is defined by the following claims. We therefore claim as our invention all that comes 
within the scope and spirit of these claims. 
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We claim: 



1 . A protein interaction assay comprising: 

contacting an array of substantially pure target polypeptide molecules stably 
associated with a surface of a solid support with a detectable probe molecule under conditions 
sufficient to produce binding; and 

detecting the binding. 

2. The assay of claim 1, further comprising removing unbound probe molecule prior 
to detecting the binding. 

3. The assay of claim 1, wherein the detectable probe molecule comprises a single- 
stranded nucleic acid, a double-stranded nucleic acid, a protein, or a ligand. 

4. The assay of claim 1, wherein the detectable probe molecule comprises a tag useful 
for detection. 

5. The assay of claim 4, wherein the tag is fluorescent, luminescent, or immunogenic. 

6. The assay of claim 1, wherein the array comprises a microarray. 

7. The assay of claim 1 , wherein the polypeptides are associated with the support at 
discrete addresses. 

8. The assay of claim 7, wherein each address contains only one substantially pure 
target polypeptide. 

9. The assay of claim 1 , wherein the binding detected is a binding pattern. 

10. An assay to determine polypeptide-binding of a probe molecule, comprising: 

(a) preparing a labeled sample of the probe molecule; 

(b) contacting the labeled sample with an array of substantially pure target 
polypeptides stably associated with the surface of a solid support under conditions sufficient to 
produce binding; 

(c) separating unbound labeled probe from the array to produce a probed array; and 

(d) detecting the binding. 

1 1. The assay of claim 10, further comprising contacting the probed array with at least 
one additional member of a signal producing system. 

12. The assay of claim 1 1, wherein the contacting the probed array with at least one 
additional member of a signal producing system is prior to detecting the binding. 

13. A universal protein array, comprising 

a plurality of substantially pure target polypeptide samples provided on a solid 

support, 

wherein the samples are immobilized on the solid support in an addressable pattern. 

14. The array of claim 13, wherein each address contains only one substantially pure 
target polypeptide. 
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15. The array of claim 13, wherein the addresses are arranged in rows and columns. 

16. The array of claim 13, wherein the array is arranged in a computer readable format. 

1 7. The array of claim 13, comprising at least 10 different polypeptide samples. 

1 8. The array of claim 13, comprising at least 30 different polypeptide samples. 

19. The array of claim 13, comprising at least 100 different polypeptide samples. 

20. The array of claim 1 3, wherein the array comprises a microarray. 

21. The array of claim 13, wherein the solid support comprises glass, nitrocellulose, 
polyvinylidene fluoride, nylon, fiber, or combinations thereof. 

22. The array of claim 1 3, wherein the polypeptides comprises transcriptional factors, 
transcriptional activators, or transcriptional coactivators. 

23. The array of claim 22, wherein the polypeptides comprise TFI1A, TFIIB, TBP, 
f:TFIID, TFIIE, TF1IF, f:TFIIH, Pol II, RXR, TR, Oct 1, Spl, G4-94, G4-147, G4-AH, G4-VPI6, 
G4-CTF, G4-Spl, G4-E1A, G4-IE, G4-Tat, PC4-P, PC4-N, PC4-C, PC4-AS, PC4-ml, PC4-m2, 
PC4-m3, PC4-m4, PC4-m5, PC4-m6, PC4-m7, PC4-wt, p52, p75, p75-C, p300-C, PCAF, PCAF-C, 
TAF250, Topo 1 (wt), Topo I (mt), Topo 1 (wt)*, Topo 1 (nati), ASF, SR, GST-Nu, or GST-K. 

24. A kit for determining polypeptide-binding of a probe molecule, comprising 
a polypeptide array; and 

instructions. 

25. The kit of claim 24, wherein the instructions include directions for exposing the 
probe molecule to an array of substantially pure polypeptides on a support under conditions in which 
the probe molecule is capable of binding to one or more of the polypeptides of the support to detect 
biological interactions between the probe molecule and the one or more polypeptides. 

26. The kit of claim 24, wherein the polypeptide array comprises a microarray. 

27. The kit of claim 24, further comprising a buffer. 

28. The kit of claim 24, wherein the polypeptide array comprises a plurality of 
substantially pure polypeptide samples. 

29. The kit of claim 24, further comprising a probe molecule standard. 

30. The kit of claim 29, wherein the probe molecule standard comprises a label. 

3 1 . The kit of claim 28, wherein the substantially pure polypeptides comprise 
transcriptional factors, transcriptional activators, or transcriptional coactivators, 

32. The kit of claim 31, wherein the polypeptides comprise TFIIA, TFIIB, TBP, 
f:TFIID, TFIIE, TFIIF, f:TFIIH, Pol II, RXR, TR, Oct 1, Spl, G4-94, G4-147, G4-AH, G4-VPI6, 
G4-CTF, G4-Spl, G4-E1A, G4-IE, G4-Tat, PC4-P, PC4-N, PC4-C, PC4-AS, PC4-ml , PC4-m2, 
PC4-m3, PC4-m4, PC4-m5, PC4-m6, PC4-m7, PC4-wt, p52, p75, p75-C, p300-C, PCAF, PCAF-C, 
TAF250, Topo I (wt), Topo I (mt), Topo I (wt)*, Topo I (nati), ASF, SR, GST-Nu, or GST-K. 

33. A method of analysis of protein-molecule interactions, comprising: 
obtaining a plurality of different substantially pure protein specimens; 
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placing a sample of each specimen in a discrete addressable location on a recipient 

array; and 

probing the array with a detectable probe molecule. 

34. The method of claim 33, wherein the array comprises a microarray. 

35. The method of claim 33, wherein the probe molecule comprises a nucleic acid, a 
polypeptide, a ligand, a fragment thereof, or mixtures thereof. 

36. A method of analyzing a plurality of binding characteristics of an array of 
polypeptide samples, comprising: 

(a) providing a protein array comprising a plurality of different polypeptide 

samples; 

(b) exposing the protein array to a first probe that may interact with the samples of 
the universal protein array to identify those samples to which the first probe binds; 

(c) detecting a first binding pattern of the first probe; 

(d) repeating (b) through (c) with a second probe to identify samples to which the 
second probe binds. 

37. The method of claim 36, further comprising stripping bound first probe from the 
array prior to exposing the array to the second probe. 

38. The method of claim 36, wherein the protein array comprises 

a plurality of substantially pure target polypeptide samples; and 
a solid support, 

wherein the samples are immobilized on the solid support in an addressable pattern. 

39. The method of claim 36, wherein the first probe and the second probe are selected 
from different classes of molecules. 

40. The method of claim 36, wherein the protein array comprises a microarray. 

41. The assay of claims 1 or 10, wherein detection is automated. 

42. The method of claim 36, wherein detection is automated. 
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FIG. 3A 
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